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Abstract 

In this paper we study the strengths and limitations of collaborative teams 
of simple agents. In particular, we discuss the efficient use of "ant robots" 
for covering a connected region on the Z 2 grid, whose area is unknown in 

advance, and which expands at a given rate, where n is the initial size of the 

> ; 

connected region. We show that regardless of the algorithm used, and the 
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robots' hardware and software specifications, the minimal number of robots 
required in order for such coverage to be possible is ^}(^/n). In addition, 
we show that when the region expands at a sufficiently slow rate, a team 
of 0(\/n) robots could cover it in at most 0(n 2 lnn) time. This completion 
time can even be achieved by myopic robots, with no ability to directly 
communicate with each other, and where each robot is equipped with a 
memory of size 0(1) bits w.r.t the size of the region (therefore, the robots 
cannot maintain maps of the terrain, nor plan complete paths). Regarding 
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the coverage of non-expanding regions in the grid, we improve the current 
best known result of 0{n 2 ) by demonstrating an algorithm that guarantees 
such a coverage with completion time of O^n 1 ' 5 + n) in the worst case, and 
faster for shapes of perimeter length which is shorter than 0(n). 
Keywords: Collaborative Cleaning, Collaborative Search, Decentralized 
Systems, Grid Search, Expanding Domains 

1. Introduction 

Motivation. In nature, ants, bees or birds often cooperate to achieve com- 
mon goals and exhibit amazing feats of swarming behavior and collaborative 
problem solving. It seems that these animals are "programmed" to interact 
locally in such a way that the desired global behavior will emerge even if 
some individuals of the colony die or fail to carry out their task for some 
other reasons. It is suggested to consider a similar approach to coordinate 
a group of robots without a central supervisor, by using only local interac- 
tions between the robots. When this decentralized approach is used much of 
the communication overhead (characteristic to centralized systems) is saved, 
the hardware of the robots can be fairly simple, and better modularity is 
achieved. A properly designed system should be readily scalable, achieving 
reliability and robustness through redundancy. 

Multi- Agent Robotics and Swarm Robotics. Significant research effort 
has been invested during the last few years in design and simulation of multi- 
agent robotics and intelligent swarm systems, e.g. (1-3). 

Swarm based robotic systems can generally be defined as highly decentral- 
ized collectives, i.e. groups of extremely simple robotic agents, with limited 



2 



communication, computation and sensing abilities, designed to be deployed 
together in order to accomplish various tasks. 

Tasks that have been of particular interest to researchers in recent years 
include synergetic mission planning (4, 5), patrolling (6, 7), fault-tolerant 
cooperation (8, 9), network security (10), swarm control (11, 12), design of 
mission plans (13, 14), role assignment (15-17), multi-robot path planning 
(7, 18, 19), traffic control (20, 21), formation generation (22-24), formation 
keeping (25, 26), exploration and mapping (27-29), target tracking (30, 31) 
and distributed search, intruder detection and surveillance (32, 33). 

Unfortunately, the mathematical / geometrical theory of such multi agent 
systems is far from being satisfactory, as pointed out in (34-37) and many 
other papers. 

Multi Robotics in Dynamic Environments. The vast majority of the 
works mentioned above discuss challenges involving a multi agent system 
operating on static domains. Such models, however, are often too limited 
to capture "real world" problems which, in many cases, involve external ele- 
ment, which may influence their environment, activities and goals. Designing 
robotic agents that can operate in such environments presents a variety of 
mathematical challenges. 

The main difference between algorithms designed for static environments 
and algorithms designed to work in dynamic environments is the fact that the 
agents' knowledge base (either central or decentralized) becomes unreliable, 
due to the changes that take place in the environment. Task allocation, 
cellular decomposition, domain learning and other approaches often used by 
multi agents systems - - all become impractical, at least to some extent. 
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Hence, the agents' behavior must ensure that the agents generate a desired 
effect, regardless the changing environment. 

One example is the use of multi agents for distributed search. While many 
works discuss search after "idle targets", recent works considered dynamic 
targets, meaning targets which while being searched for by the searching 
robots, respond by performing various evasive maneuvers intended to pre- 
vent their interception. This problem, dating back to World War II opera- 
tions research (see e.g. (38, 39)), requires the robotic agents to cope with 
a search area that expands while scanned. The first documented example 
for search in dynamic domains discussed a planar search problem, consider- 
ing the scanning of a corridor between parallel borders. This problem was 
solved in (40) in order to determine optimal strategies for aircraft searching 
for moving ships in a channel. 

A similar problem was presented in (41), where a system consisting of a 
swarm of UAVs (Unmanned Air Vehicles) was designed to search for one or 
more "smart targets" (representing for example enemy units, or alternatively 
a lost friendly unit which should be found and rescued). In this problem the 
objective of the UAVs is to find the targets in the shortest time possible. 
While the swarm comprises relatively simple UAVs, lacking prior knowledge 
of the initial positions of the targets, the targets are assumed to be adversar- 
ial and equipped with strong sensors, capable of telling the locations of the 
UAVs from very long distances. The search strategy suggested in (41) defines 
flying patterns for the UAVs to follow, designed for scanning the (rectangu- 
lar) area in such a way that the targets cannot re-enter areas which were 
already scanned by the swarm without being detected. This problem was 
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further discussed in (42), where an improved decentralized search strategy 
was discussed, demonstrating nearly optimal completion time, compared to 
the theoretical optimum achievable by any search algorithm. 
Collaborative Coverage of Expanding Domains. In this paper we shall 
examine a problem in which a group of ant-like robotic agents must cover an 
unknown region in the grid, that possibly expands over time. This problem 
is also strongly related to the problem of distributed search after mobile 
and evading target (s) (42-44) or the problems discussed under the names of 
"Cops and Robbers" or "Lions and Men" pursuits (45-48). 

We analyze such issues using the results presented in (49-51), concerning 
the Cooperative Cleaners problem, a problem that assumes a regular grid 
of connected 'pixels' / 'tiles' / 'squares' / 'rooms', part of which are 'dirty', 
the 'dirty' pixels forming a connected region of the grid. On this dirty grid 
region several agents move, each having the ability to 'clean' the place (the 
'room', 'tile', 'pixel' or 'square') it is located in. In the dynamic variant 
of this problem a deterministic evolution of the environment in assumed, 
simulating a spreading contamination (or spreading fire). In the spirit of (52) 
we consider simple robots with only a bounded amount of memory (i.e. finite- 
state-machines). 

First, we discuss the collaborative coverage of static grids. We demon- 
strate that the best completion time known to date (0(n 2 ), achievable for 
example using the LRTA* search algorithm (53)) can be improved to guar- 
antee grid coverage in 0(j-n lm5 + n). 

Later, we discuss the problem of covering an expanding domain, namely - 
a region in which "covered" tiles that are adjacent to "uncovered" tiles be- 
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come "uncovered" every once in a while. Note that the grid is infinite, namely 
although initial size of the region is n, it can become much greater over time. 
We show that using any conceivable algorithm, and using as sophisticated 
and potent robotic agents as possible, the minimal number of robots below 
which covering such a region is impossible equals Q(y/n). We then show 
that when the region expands sufficiently slow, specifically — every O(^) 
time steps (where Co is the circumference of the region and where 71 is a 
geometric property of the region, which ranges between 0(1) and O(lnn)), 
a group of 0(v / n) robots can successfully cover the region. Furthermore, we 
demonstrate that in this case a cover time of 0(n 2 lnn) can be guaranteed. 

These results are the first analytic results ever concerning the complexity 
of the number of robots required to cover an expanding grid, as well as for 
the time such a coverage requires. 

2. Related Work 

In general, most of the techniques used for the task of a distributed cov- 
erage use some sort of cellular decomposition. For example, in (54) the area 
to be covered is divided between the agents based on their relative locations. 
In (55) a different decomposition method is being used, which is analytically 
shown to guarantee a complete coverage of the area. Another interesting 
work is presented in (56), discussing two methods for cooperative coverage 
(one probabilistic and the other based on an exact cellular decomposition). 
All of the works mentioned above, however, rely on the assumption that 
the cellular decomposition of the area is possible. This in turn, requires the 
use of memory resources, used for storing the dynamic map generated, the 
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boundaries of the cells, etc'. As the initial size and geometric features of the 
area are generally not assumed to be known in advance, agents equipped with 
merely a constant amount of memory will not be able to use such algorithms. 

Surprisingly, while some existing works concerning distributed (and de- 
centralized) coverage present analytic proofs for the ability of the system 
to guarantee the completion of the task (for example, in (55-57)), most of 
them lack analytic bounds for the coverage time (although in many cases an 
extensive amount of empirical results of this nature are made available by 
extensive simulations). 

An interesting work discussing a decentralized coverage of terrains is pre- 
sented in (58). This work examines domains with non- uniform traversability. 
Completion times are given for the proposed algorithm, which is a general- 
ization of the forest search algorithm (59). In this work, though, the region 
to be searched is assumed to be known in advance — a crucial assumption 
for the search algorithm, which relies on a cell-decomposition procedure. 

A search for analytic results concerning the completion time of ant-robots 
covering an area in the grid revealed only a handful of works. The main result 
in this regard is that of (60, 61), where a swarm of ant-like robots is used 
for repeatedly covering an unknown area, using a real time search method 
called node counting. By using this method, the robots are shown to be able 
to efficiently perform such a coverage mission (using integer markers that are 
placed on the graph's nodes), and analytic bounds for the coverage time are 
discussed. Based on a more general result for strongly connected undirected 
graphs shown in (62, 63), the cover time of teams of ant robots (of a given 
size) that use node counting is shown to be t k {n) = 0(n^™), when tk(n) the 
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cover time of a region of size n using k robots. It should be mentioned though, 
that in (60) the authors clearly state that it is their belief that the coverage 
time for robots using node counting in grids is much smaller. This evaluation 
is also demonstrated experimentally. However, no analytic evidence for this 
was available thus far. 

Another algorithm to be mentioned in this scope is the LRTA* search 
algorithm. This algorithm was first introduced in (53) and its multi-robotics 
variant is shown in (62) to guarantee cover time of undirected connected 
graphs in polynomial time. Specifically, on grids this algorithm is shown to 
guarantee coverage in 0{n 2 ) time (again, using integer markers). 

Vertex- Ant-Walk, a variant of the node counting algorithm is presented in 
(64), is shown to achieve a coverage time of 0(nSa), where 5q is the graph's 
diameter. Specifically, the cover time of regions in the grid is expected to be 
0(n 2 ) (however for various "round" regions, a cover time of approximately 
0(n L5 ) can be achieved). This work is based on a previous work in which a 
cover time of 0{ti 2 8g) was demonstrated (65). 

Another work called Exploration as Graph Construction, provides a cov- 
erage of degree bounded graphs in 0(n 2 ) time, can be found in (66). Here a 
group of limited ant robots explore an unknown graph using special "mark- 
ers". 

Interestingly, a similar performance can be obtained by using the simplest 
algorithm for multi robots navigation, namely — random walk. Although in 
general undirected graphs a group of k random walking robots may require 
up to 0(n 3 ) time, in degree bounded undirected graphs such robots would 
achieve a much faster covering, and more precisely, O 
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regular graphs or degree bounded planar graphs a coverage time of 0(n 2 ) 
can be achieved (68), although in such case there is also a lower bound for 
the coverage time, which equals f2(n(logn) 2 ) (69). 

We next show that the problem of collaborative coverage in static grid do- 
mains can be completed in O^n 1 - 5 + n) time and that collaborative coverage 
of dynamic grid domains can be achieved in 0(n 2 lnn). 

3. The Dynamic Cooperative Cleaners Problem 

Following is a short summary of the Cooperative Cleaners problem, as 
appears in (49) (static variant) and (50, 51) (dynamic variant). 

We shall assume that the time is discrete. Let the undirected graph 
G(V, E) denote a two dimensional integer grid Z 2 , whose vertices (or "tiles") 
have a binary property called 'contamination 1 . Let cont t {y) state the con- 
tamination state of the tile v at time t, taking either the value "on" or 
"off". Let F t be the contaminated sub-graph of G at time t, i.e. : F t = 
{v e G | cont t (v) = on}. We assume that F is a single connected compo- 
nent. Our algorithm will preserve this property along its evolution. 

Let a group of k robots that can move on the grid G (moving from a 
tile to its neighbor in one time step) be placed at time t on F , at point 
p G F t . Each robot is equipped with a sensor capable of telling the con- 
tamination status of all tiles in the digital sphere of diameter 7, surrounding 
the robot (namely, in all the tiles that their Manhattan distance from the 
robot is equal or smaller than 3. See an illustration in Figure 1). A robot 
is also aware of other robots which are located in these tiles, and all the 
robots agree on a common direction. Each tile may contain any number of 
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robots simultaneously. Each robot is equipped with a memory of size 0(1) 
bits 1 . When a robot moves to a tile v, it has the possibility of cleaning this 
tile (i.e. causing cont(v) to become off. The robots do not have any prior 
knowledge of the shape or size of the sub-graph F except that it is a single 
and simply connected component. 
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Figure 1: An illustration of a digital sphere of diameter 7, placed around a robot. 

The contaminated region F t is assumed to be surrounded at its bound- 
ary by a rubber-like elastic barrier, dynamically reshaping itself to fit the 
evolution of the contaminated region over time. This barrier is intended to 
guarantee the preservation of the simple connectivity of F t , crucial for the 
operation of the robots, due to their limited memory. When a robot cleans 
a contaminated tile, the barrier retreats, in order to fit the void previously 
occupied by the cleaned tile. Every d time steps, the contamination spreads. 
That is, if t = nd for some positive integer n, then : 

Vf G F t \/u G 4 — Neighbor s{y) , cont t+ i{u) = on 

1 For counting purposes the agents must be equipped with counters that can store the 
number of agents in their immediate vicinity. This can of course be implemented using 
0(log k) memory. However, throughout the proof of Lemma 5 in (49) it is shown that the 
maximal number of agents that may simultaneously reside in the same tile at any given 
moment is upper bounded by 0(1). Therefore, counting the agents in the immediate 
vicinity can be done using counters of O(l) bits. 
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Here, the term 4 — Neighbor s(v) simply means the four tiles adjacent to 
tile v (namely, the tiles whose Manhattan distance from v equals 1). While 
the contamination spreads, the elastic barrier stretches while preserving the 
simple-connectivity of the region, as demonstrated in Figure 2. For the robots 
who travel along the tiles of F, the barrier signals the boundary of the con- 
taminated region. 
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Figure 2: A demonstration of the barrier expansion process as a result of a contamination 
spread. 



The robots' goal is to clean G by eliminating the contamination entirely. 

It is important to note that no central control is allowed, and that the 
system is fully decentralized (i.e. all robots are identical and no explicit com- 
munication between the robots is allowed). An important advantage of this 
approach, in addition to the simplicity of the robots, is fault-tolerance — even 
if almost all the robots die and evaporate before completion, the remaining 
ones will eventually complete the mission, if possible. 

A Survey of Previous Results. The cooperative cleaners problem was 
previously studied in (49) (static version) and (50, 51) (dynamic version). A 
cleaning algorithm called SWEEP was proposed (used by a decentralized 
group of simple mobile robots, for exploring and cleaning an unknown "con- 
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taminated" sub-grid F, expanding every d time steps) and its performance 
analyzed. 

The SWEEP algorithm is based in a constant traversal of the contami- 
nated region, preserving the connectivity of the region while cleaning all non 
critical points — points which when cleaned disconnect the contaminated 
region. Using this algorithm the agents are guaranteed to stop only upon 
completing their mission. The algorithm can be implemented using only local 
knowledge, and local interactions by immediately adjacent agents. At each 
time step, each agent cleans its current location (assuming it is not a critical 
point), and moves according to a local movement rule, creating the effect 
of a clockwise "sweeping" traversal along the boundary of the contaminated 
region. As a result, the agents "peel" layers from the region, while preserving 
its connectivity, until the region is cleaned entirely. An illustration of two 
agents working according to the protocol can be seen in Figure 3. 




Figure 3: An example of two agents using the SWEEP protocol, at time step 40 (with 
contamination spreading speed d > 40). All the tiles presented were contaminated at time 
0. The black dot denotes the starting point of the agents. The X's mark the critical points 
which are not cleaned. The black tiles are the tiles cleaned by the first agent. The second 
layer of marked tiles represent the tiles cleaned by the second agent. 
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In order to formally describe the SWEEP algorithm, we should first 
define several additional terms. Let r(t) = (ri(t), r 2 (t), . . . , Tk(t)) denote the 
locations of the k agents at time t. In addition, let fj(t) denote the "previous 
location" of agent i. Namely, the last tile that agent i had been at, which is 
different than T,i(t). This is formally defined as : 

n(t) = Ti(x) s.t. x = max{j G N | j < t and r^j) ^ T^t)} 
The term dF denotes the boundary of F, defined via : 

dF = {v | v G F A 8 - Neighbors(v) n (G \ F) ^ 0} 
The term 'rightmost' can now be defined as follows : 

• If t — then select the tile as instructed in Figure 4. 

• If fi(t) G dF t then starting from fi(t) (namely, the previous boundary 
tile that the agent had been in) scan the four neighbors of Tj(£) in a 
clockwise order until a boundary tile (excluding fi(t)) is found. 

• If not fi(t) G dF t then starting from fj(t) scan the four neighbors of 
Ti{t) in a clockwise order until the second boundary tile is found. 

A schematic flowchart of the protocol, describing its major components 
and procedures is presented in Figure 5. The complete pseudo-code of the 
protocol and its sub-routines appears in Figures 6 and 7. Upon initialization 
of the system, the System Initialization procedure is called (defined in Fig- 
ure 6). This procedure sets various initial values of the agents, and calls the 
protocol's main procedure — SWEEP (defined in Figure 7). This procedure 
in turn, uses various sub-routines and functions, all defined in Figure 6. The 
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Figure 4: When t — the first movement of an agent located in (x, y) should be decided 
according to initial contamination status of the neighbors of (x,y), as appears in these 
charts — the agent's initial location is marked with a filled circle while the destination is 
marked with an empty one. All configurations which do not appear in these charts can be 
obtained by using rotations. This definition is needed in order to initialize the traversal 
behavior of the agents in the correct direction. 



SWEEP procedure is comprised of a loop which is executed continuously, 
until detecting one of two possible break conditions. The first, implemented 
in the Check Completion of Mission procedure, is in charge of detecting 
cases where all the contaminated tiles have been cleaned. The second con- 
dition, implemented in the Check Near Completion of Mission procedure, is 
in charge of detecting scenarios in which every contaminated tile contains 
at least a single agent. In this case, the next operation would be a simul- 
taneous cleaning of the entire contaminated tiles. Until these conditions are 
met, each agent goes through the following sequence of commands. First 
each agent calculates its desired destination at the current turn. Then, each 
agent calculated whether it should give a priority to another agent located 
at the same tile, and wishes to move to the same destination. When two or 



14 



more agents are located at the same tile, and wish to move towards the same 
direction, the agent who had entered the tile first gets to leave the tile, while 
the other agents wait. In case several agents had entered the tile at the same 
time, the priority is determined using the Priority function. Before actually 
moving, each agent who had obtained a permission to move, must now locally 
synchronize its movement with its neighbors, in order to avoid simultaneous 
movements which may damage the connectivity of the region. This is done 
using the waiting dependencies mechanism, which is implemented by each 
agent via an internal positioning of itself in a local ordering of his neighbor- 
ing agents. When an agent is not delayed by any other agent, it executes its 
desired movement. It is important to notice that at any given time, waiting 
or resting agents may become active again, if the conditions which made 
them become inactive in the first place, had changed. 

Following are several results that we will later use. While using these 
results, we note that completely cleaning a region is at least as strong as cov- 
ering it, as the number of "uncovered" tiles at any given time can be modeled 
by the number of "contaminated" tiles, since the number of uncovered tiles 
in the original region to be explored is clearly upper bounded at all times by 
the number of remaining contaminated tiles that belong to this region. 

Result 1. (Cleaning a Non-Expanding Contamination) The time it 
takes for a group of K robots using the SWEEP algorithm to clean a region 
F of the grid is at most: 

A 8(\dF \-l).(W(F ) + k) 

''static — } \- £tv 

K 

Here W(F) denotes the depth of the region F (the shortest path from 
some internal point in F to its boundary, for the internal point whose shortest 
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Figure 5: A schematic flow chart of the SWEEP protocol. The smooth lines represent 
the basic flow of the protocol while the dashed lines represent cases in which the flow is 
interrupted. Such interruptions occur when an agent calculates that it must not move 
until other agents do so (either as a result of waiting or resting dependencies — see lines 6 
and 14 of SWEEP for more details). 
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1: System Initialization 

2: Arbitrarily choose a pivot point po in 8Fq, and mark it as critical point 

3: Place all the agents on po 

4: For (i = 1; i < k; i + +) do 

5: Call Agent Reset for agent i 

6: Call SWEEP for agent i 

7: Wait two time steps 

8: End for 

9: End procedure 

10: Agent Reset 

1 1 : resting <— false 

12: dest <— null /* destination */ 

13: near completion -f— false 

14: saturated perimeter <— false 

15: waiting 4— 

16: End procedure 

17: Priority 

18: /* Assuming the agent moved from (xo,yo) to (xi,yi) */ 

19: priority <— 2(x± — xo) + (yi — yo) 

20: End procedure 

21: Check Completion of Mission 

22: If {{x,y) = po) an d (x,y) has no contaminated neighbors then 
23: If (x,y) is contaminated then 

24: Clean (x, y) 

25: STOP 

26: End procedure 

27: Check "Near Completion" of Mission 

28: /* Cases where every tile in Ft contains at least a single agent */ 

29: near completion 4— false 

30: If each of the contaminated neighbors of (x,y) contains at least one agent then 

31: near completion «— true 

32: If each of the contaminated neighbors of (x,y) satisfies near completion then 

33: Clean (x, y) and STOP 

34: /* Cases where every non-critical tile in dFt contains at least 2 agents */ 

35: saturated perimeter <— false 

36: If ((x,y) e dFt) and both (x, y) and all of its non-critical neighbors 

in dFt contain at least two agents then 
37: saturated perimeter <— true 

38: If ((x,y) £ dFt) and both (x,y) and all of its neighbors in dFt has 

saturated perimeter = true then 
39: Ignore resting commands for this time step 

40: End procedure 



1: SWEEP Protocol /* Controls agent i after Agent Reset */ 



2: Check Completion of Mission 

3: Check "Near Completion" of Mission 

4: dest <— rightmost neighbor of (x,y) /* Calculate destination */ 
5: destination signal bits <— dest /* Signaling the desired destination */ 
6: /* Calculate resting dependencies (solves agents' clustering problem) */ 
7: Prom all agents in (x,y) except agent i we define be the following groups: 
8: K\ : Agents signaling towards dest which entered (x,y) before agent i 

9: K2 ■ Agents signaling towards dest which entered (x,y) with agent i, 

and with higher priority than this of agent i 

10: resting 4— false 

11: If (K-l + 0) or (K 2 + 0) then 

12: resting <— true 

13: If (current time-step T did not end yet) then jump to 4 Else jump to 30 

14: waiting <— /* Waiting dependencies (agents synchronization) */ 
15: Let active agent denote a non-resting agent which didn't move in T yet 
16: If (x-l,y) 6 Ft contains an active agent then waiting -s— waiting U {left} 
17: If (x,y-l) g Ft contains an active agent then waiting <— waiting U {down} 
18: If (x-l,y-l) £ Ft contains an active agent then waiting <— waiting U {l-d} 
19: If (x+l,y-l) 6 Ft contains an active agent then waiting <— waiting U {r-d} 
20: If dest = right and (x+l,y) contains an active agent j, and destj ^ left, and 
there are no other agents delayed by agent i (i.e. (x-l,y) does not contain 
active agent I with desti =right and no active agents in (x,y+l),(x+l,y+l), 
(x-l,y+l), and (x+l,y) does not contain active agent n with dest n = left), 
then (waiting <— waiting U {right}) and (waitingj <— waiting^ \ {left}) 
21: If dest = up and (x,y+l) contains an active agent j, and destj ^ down, and 
there are no other agents delayed by agent i (i.e. (x,y-l) docs not contain 
active agent / with desti =up and no active agents in (x+l,y),(x+l,y+l), 
(x-l,y+l), and (x,y+l) does not contain active agent n with dest n = down), 
then (waiting <— waiting U {up}) and (waiting^ <— waiting^ \ {down}) 
22: If (waiting 7^ 0) then 

23: If (T has not ended yet) then jump to 4 Else jump to 30 

24: /* Decide whether or not (x,y) should be cleaned */ 

25: If -1 ((x,y) £ dFt) or ((x,y) = po) or (x,y) has 2 contaminated tiles in its 

^Neighbors which are not connected via a path of contaminated tiles from its 
^Neighbors then 

26: ( x ,y) is an internal point or a critical point and should not be cleaned 

27: Else 

28: Clean (x,y) if and only if it does not still contain other agents 

29: Move to dest 

30: Wait until T ends. 18 
31: Return to 2 



Figure 7: The SWEEP cleaning protocol. 



path is the longest) and as defined above, dF denotes the boundary of F, 
defined via : 

dF = {v | v e F A 8 - Neighbors(v) n (G \ F) ^ 0} 

The term 8 — Neighbor s(v) is used to denote the eight tiles that tile v is 
immediately surrounded by. 

Result 2. (Universal Lower Bound on Contaminated Area) Using 
any cleaning algorithm, the area at time t of a contaminated region that 
expands every d time steps can be recursively lower bounded, as follows : 

S t+d >S t -d-k+ \2y/2 ■ (S t - d ■ k) - 1 

Here S t denotes the area of the contaminated region at time t (such that 
S = n). 

Result 3. (Upper Bound on Cleaning Time for SWEEP on Ex- 
panding Domains) For a group of k robot using the SWEEP algorithm 
to clean a region F on the grid, that expands every d time steps, the time it 
takes the robots to clean F is at most d multiplied by the minimal positive 
value of the following two numbers : 

(A 4 - A 1 A 3 ) ± V /(A 1 A 3 - A 4 ) 2 - AA 3 (A 2 -A 1 - A^) 



where 



2A, 



c + 2 - 72 _ c + 2 + 72 _ 8 • 72 

Al ~ 4 ' M ~ 4 ' M ~ TT 

A 4 = 7i-^7^ , 7i = ^(l + 40-^(1 + ^1) 
a 



7 



72 = V(co + 2) 2 -85o + 8 
8(k + W(F )) d-2k 



k \dF \ 
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Here c is the circumference of the initial region F , and where ij){x) is 
the Digamma function (studied in (70)) — the logarithmic derivative of the 
Gamma function, defined as : 



Note that although c = 0(\dF\) the actual length of the perimeter of the 
region can be greater than its cardinality, as several tiles may be traversed 
more than once. In fact, in (49) it is shown that c < 2 • \dF\ — 2. 

4. Grid Coverage — Analysis 

We note again that when discussing the coverage of regions on the grid, 
either static or expanding, it is enough to show that the region can be cleaned 
by the team of robots, as clearly the cleaned sites are always a subset of the 
visited ones. We first present the cover time of a group of robots operating 
in non-expanding domains, using the SWEEP algorithm. 

Theorem 1. Given a connected region of S = n grid tiles and perimeter c 0; 
that should be covered by a team of k ant-like robots, the robots can cover it 
using O (j.Sq 5 + So) time. 



Proof. Since \dF \ = 6(c ), and W(F ) = 0(^/S^), recalling Result 1 we can 

see that : 




or as : 
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As c = O(S ) and as for practical reasons we assume that k < n this 
would equal in the worse case to : 

1 



t k (n)=t static (k) = O[-S 1 5 + S 



□ 



We now examine the problem of covering expanding domains. The lower 
bound for the number of robots required for completing is as follows. 

Theorem 2. Given a region of size S > 3 tiles, expanding every d time 
steps, then a team of less than robots cannot clean the region, regardless 
of the algorithm used. 



Proof. Recalling Result 2 we can see that : 



St+d — S t > 



2y/2- (S t - d-k)-l 



-d-k 



By assigning k = we can see that 



AS t = S t+d -S t > 



2 a/2 • (S t - y/So) - 1 



For any S > 3, we see that AS'o > 0. In addition, for every > 3 we 
can see that ^ > for every t > 0. Therefore, for every So > 3 the size of 
the region will be forever growing. □ 

Corollary 1. Given a region of size Sq tiles, expanding every d time steps, 
where d = 0(1) w.r.t S , then a team of less than fl(y/^o) robots cannot 
clean the region, regardless of the algorithm used. 
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Theorem 3. Given a region F of size S tiles, expanding every d time steps, 
where R(F) is the perimeter of the bounding rectangle of the region F, then 
a team of k robots that att = are located at the same tile cannot clean the 
region, regardless of the algorithm used, as long as d 2 k < Q(R(F)). 

Proof. For every v G F let l{v) denote the maximal distance between v and 
any of the tiles of F, namely : 

l(v) = ma.x{d(v, u)\u G F} 

Let C(F) = l(v c ) such that v c G F is the tile with minimal value of l(v). 

Let v s denote the tile the agents are located in at t — 0. Let v d G F 
denote some contaminated tile such that d(v s ,Vd) = l(v s ). Regardless of the 
algorithm used by the agents, until some agent reaches Vd there will pass at 
least l(v s ) time steps. Let us assume w.l.o.g that v d is located to the right 
(or of the same horizontal coordinate) and to the top (or of the same vertical 
coordinate) of v s . Then by the time some agent is able to reach Vd there 



exists an upper-right quarter of a digital sphere of radius 
center is Vd- 



i(v a ) 



+ 1, whose 



The number of tiles in such a quarter of digital sphere equals : 



l(v s 
d 



3 

+ 2 



l{Vs 
d 



+ 1 = 



l(Vs) 
d? 



\2 



It is obvious that the region cannot be cleaned until Vd is cleaned. Let 
td denote the time at which the first agent reaches Vd- It is easy to see that 
td > l(v s ). Therefore, regardless of activities of the agents until time step 
td, there are now k agents that has to clean a region of at least ( ^jr ^ 
tiles, spreading every d time steps. Using Theorem 2 we know that k agents 

22 



cannot clean an expanding region of k — ^j 1 tiles. Namely, at time t d k 
agents could not clean the contaminated tiles if : 

d 2 k <Q{l{v s )) 

As l(v s ) > C(F) we know that k agents could not clean an expanding 
contaminated region where : d 2 k < f2 (C(F)). It is easy to see that for every 
region F, if R(F) is the length of the perimeter of the bounding rectangle of 
F then C(F) = Q(R(F)). □ 

Lemma 1. For every connected region of size So > 3 and perimeter of length 
c : 

1 

-c < 72 < c 

Proof, let us assume by contradiction that (c + 2) 2 < (8S0 + 8). This means 



Co < + 8 — 2. However, the minimal circumference of a region of size 

So is achieved when the region is arranged in the form of an 8-connected 
digital sphere, in which case c > ^^/Sq — 4, contradicting the assumption 



that c < V8S0 + 8 - 2 for every S > 5. Therefore, 72 G ffiL 
Let us assume by contradiction that 72 < \cq. Therefore : 



(c + 2) 2 - 8S0 + 8 < ^c 2 



which implies : 



c o < + \ W l S ° ~ 4 < 3-266^ - 2 
o V 3 9 



However, we know that Co > 4v / So — 4, which contradicts the assumption 
that 72 < I c for every S > 3. 
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Let us assume by contradiction that 72 > c . Therefore : 

(c + 2) 2 - 8S + 8 > c 2 

which implies : 

c > 4S - 6 

However, we know that cq < 2 So — 2 (as Co is maximized when the tiles are 
arranged in the form of a straight line), contradicting the assumption that 
72 > c for every S > 3. □ 

Lemma 2. For every connected region of size S > 3 and perimeter of length 
c : 

n(l) < 71 < O(lnn) 

Proof Let us observe 7x : 

From Lemma 1 we can see that 1 < (l + c ° +2 ~ 72 ) < ^co- Note that "0(1) = 
—7 where 7 is the Euler-Mascheroni constant, defined as : 




which equals approximately 0.57721. In addition, ip(x) is monotonically in- 
creasing for every x > 0. As we also know that ip(x) is upper bounded by 
O(lnx) for large values of x, we see that : 

-0.58 < V (l + Co + 2 ~ l2 \ < O(lnn) (4.1) 
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From Lemma 1 we also see that 1 < (l + 00+2+72 ) < ^c meaning that : 
^(l+ C ° + 2 4 + 72 )= 9(lnn) (4.2) 
Combining equations 4.1 and 4.2 we see that : 

<7i < OQnn) (4.3) 

□ 

Theorem 4. Result 3 returns a positive real number for the covering time 
of a region of So tiles that expands every d time steps, when the number of 
robots is 6(v / 3o) and d = fi(^), and where 71 shifts from 0(1) to 0(ln>So) 
as Co grows from O(y/S~o) to O(Sq), defined as : 

7l ^( 1 + £i±l±^)_^ 1 + B!±^) 

72 = V(co + 2) 2 -85o + 8 

Proof. Following are the requirements that must hold in order for Result 3 
to yield a real number : 

• d-k^0 

• \dF\ > 1 

• A 3 ^0 

• (c + 2) 2 > 8S - 8 

• (AA 3 - A 4 f > AA 3 (A 2 -A,- A l A 4 ) 
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The first and second requirements hold for every non trivial scenario. The 
third requirement is implied by the fourth. The fourth assumption is a direct 
result of Lemma 1 . 

As for the last requirement, we ask that : 



A\A\ + A\ > AA 2 A 3 - 4Ai;4 3 - 2A 1 A 3 A A 



which subsequently means that we must have : 



(c 2 + ll - C072) >4 72 

+^2 , _ 77172 ~~ dk 
^ n ^ d d 



\ 



472 - c 7i+ 
coY ~ 27i + 2Y- 
7i72 - $ 



Using Lemma 1 and Lemma 2 we should make sure that : 



ll 2 . 2 . . 2 2 ^ ^ / c o727i . c 7|7\ 

Using VU(F) = O(v^So) and fi(v^o) = W\ = O(S ) and dividing by 7 | 
(which we know to be larger than zero), we can write the above as follows : 



dk' 



+ 



k 2 + dln 2 S 



+ 1 > 



. .'la S k\nS \nS c Q \fS^ c 1 

O ( + q — + — - + , " + -T + 



Co 05 fc dk 2 dk d 

.2 



As c > v 7 ^ then ^ > In addition, 1 > \ and also 1 > ^ and 

1 > ^p 1 (as Eq. 4.6 shows that > InS'o). In order to have also 1 > ^ we 
must have : 



d • k = fl(c ) 



(4.4) 
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In addition, we should also require that the result fj, would be positive (as 
it denotes the coverage time). Namely, that : 

A A + ^/(AAs - A 4 ) 2 - AA 3 (A 2 - M - AiA 4 ) > A ± A 3 

For this to hold we shall merely require that: 

A 2 -A 1 - A X A 4 < 

(as A 3 is known to be positive). Assigning the values of Ai,A 2 ,A^ this 
translates to : 



c + Cq -j < O(c 7i) 



Dividing by Co we can write : 



Namely : 



]_ _|_ d I A: 

1 + c fc - c ° cu < 0( 7 l) 



c + + fe < d0 ( 7l ) 



As c is the dominant element of the left side of the inequation, we see 
that : 

d = n(|) (4.5) 

Assigning this lower bound for d we can now see that : 

^(v^o") < k < 0(cq) (4.6) 
Therefore, we shall select the value of k such that : 

k = 6( v / 5o) 

This also satisfies Equation 4.4. □ 

27 



Theorem 5. The time it takes a group of k = Q(^/S~o) robots using the 
SWEEP algorithm to cover a connected region of size S tiles, that expands 
every d = il(fj-) ^ me s t e P s , i s upper bounded as follows : 



^success — O [Sq In S ) 

where 71 shifts from 0(1) to O(lnSo) as c grows from O(yfSo) to O(So), 
defined as : 



72 = V(co + 2) 2 -85o + 8 



Proof. Recalling Result 1 we know that if : 

8(\dF \-l)-(W(F ) + k) 



+ 2k < d 



k 

then the robots could clean the region before it expands even once. In this 
case, the cleaning time would be O(^\/S^-c + c ) as was shown in Theorem 1. 
Therefore, we shall assume that : 

8(|5F | — 1) • (W(F ) + k) 



k 

Observing Result 3 we see that : 

^success = 



+ 2k>d (4.7) 



.. , |A 4 | / 2 \A 1 A 4 \+A 1 + A 2 A\ 
d -°\ A ^^ + \l A ^ M + If'- 



M V^Wl^i^l + A x + A 2 \A 
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d-0\ A 1 + 



I A, 



AM 



A 3 VA~ 3 
I c + 72 + dkf 2 + A; 7 + 

Using the fact that 72 = O(c ) (Lemma 1) we can rewrite this expression 



d-0 



as : 



d ■ O ^c + c?A;— + A; 7 + Vk\J~dqi + c o7 + \/A^ (4.8) 
Recalling Equation 4.7, and as W(F ) = 0(-\/So), we can see that : 

/So • c 



+ c + k 



Therefore, | 7 | can now be written as : 



n\ = 



+ v/s + 



k 



+ 1 



k ^/So 

In addition, remembering that O(y/So) < c < O(S ) we can rewrite the 
expression of Equation 4.8 as follows : 



d-0 



d-0 



c + ^ + AVSo + ^+ 



r kc Q 



-7- + V^o H 



<■() 



7i + 



c + d/c^ + fcv/So + 



/So 

fc 2 



+ Vkd 



CO 



/^(v^r+^sox/f +^0 + ^) 

Using Lemma 2 we see that : 

c + dA;^ + A;v/So ^ 



+ 



V^ColnSov^ + a/ cq/ca/So + ktfS^ + 



A; 2 



c + fcvSohiSo + 7^+ 



V / c o lnS v / So"+ a/co^VS^ 



(4.9) 
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Assuming that k > O(\nS ) and as c = O(Sq) we can now write : 



O 




(4.10) 



Using Equation 4.6 we can see that this translates to : 



O(S^ 5 k\nS ) 



(4.11) 



□ 



5. Conclusions 

In this paper we have discussed the problem of cleaning or covering a 
connected region on the grid using a collaborate team of simple, finite-state- 
automata robotic agents. We have shown that when the regions are static, 



in the worst case, thus improving the previous results for this problem. In 
addition, we have shown that when the region is expanding in a constant rate 
(which is "slow enough"), a team of Q(y/n) robots can still be guaranteed to 
clean or cover it, in 0(n 2 lnn) time. 

In addition, we have shown that teams of less than Q(y/n) robots can 
never cover a region that expands every 0(1) time steps, regardless of their 
sensing capabilities, communications and memory resources employed, or 
the algorithm used. As to regions that expand slower than every 0(1) time 
steps, we have shown the following impossibility results. First, a region of 
n tiles that expands every d time steps cannot be covered by a group of k 
agents if dk < 0(y/n). Using Theorem 4 we can guarantee a coverage when 



this can be done in 0(^^/n • c + c + k) time which equals 0(|n L5 + n) time 
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dk = or even for dk = Q(n)) (when the region's perimeter c equals 

0(n)). 

Second, a spreading region cannot be covered when d 2 k is smaller than 
the order of the perimeter of the bounding rectangle of the region (which 
equals 0(n) in the worse case and 0(y/n) for shapes with small perimeters). 
Using Theorem 4 we can guarantee a coverage when d 2 k = {l(j^), or for 
d 2 k = f2(n L5 ) (when the region's perimeter Co equals 0(n)). 

We believe that these results can be easily applied to various other prob- 
lems in which a team of agents are required to operate in an expanding grid 
domain. For example, this result can show that a team of 0(v / n) cops can 
always catch a robber (or for that matter — a group of robbers), moving 
(slowly) in an unbounded grid. Alternatively, robbers can be guaranteed to 
escape a team of less than 0(y/n) cops, if the area they are located in is 
unbounded. 
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